Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary
Identifieur interne : 000017 ( Main/Exploration ); précédent : 000016; suivant : 000018Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary
Auteurs : Franck Sajous [France] ; Emmanuel Navarro [France] ; Bruno Gaume [France] ; Laurent Prévot [France] ; Yannick Chudy [France]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2010.
Abstract
Abstract: The lack of large-scale, freely available and durable lexical resources, and the consequences for NLP, is widely acknowledged but the attempts to cope with usual bottlenecks preventing their development often result in dead-ends. This article introduces a language-independent, semi-automatic and endogenous method for enriching lexical resources, based on collaborative editing and random walks through existing lexical relationships, and shows how this approach enables us to overcome recurrent impediments. It compares the impact of using different data sources and similarity measures on the task of improving synonymy networks. Finally, it defines an architecture for applying the presented method to Wiktionary and explains how it has been implemented.
Url:
DOI: 10.1007/978-3-642-14770-8_37
Affiliations:
- France
- Midi-Pyrénées, Occitanie (région administrative), Provence-Alpes-Côte d'Azur
- Marseille, Toulouse
- Aix-Marseille Université, Université de Provence
Links toward previous steps (curation, corpus...)
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct:series"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary</title>
<author><name sortKey="Sajous, Franck" sort="Sajous, Franck" uniqKey="Sajous F" first="Franck" last="Sajous">Franck Sajous</name>
</author>
<author><name sortKey="Navarro, Emmanuel" sort="Navarro, Emmanuel" uniqKey="Navarro E" first="Emmanuel" last="Navarro">Emmanuel Navarro</name>
</author>
<author><name sortKey="Gaume, Bruno" sort="Gaume, Bruno" uniqKey="Gaume B" first="Bruno" last="Gaume">Bruno Gaume</name>
</author>
<author><name sortKey="Prevot, Laurent" sort="Prevot, Laurent" uniqKey="Prevot L" first="Laurent" last="Prévot">Laurent Prévot</name>
</author>
<author><name sortKey="Chudy, Yannick" sort="Chudy, Yannick" uniqKey="Chudy Y" first="Yannick" last="Chudy">Yannick Chudy</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:90792A953C1E3C5D11D0476C5E6873AE5BF8D8B1</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1007/978-3-642-14770-8_37</idno>
<idno type="url">https://api.istex.fr/document/90792A953C1E3C5D11D0476C5E6873AE5BF8D8B1/fulltext/pdf</idno>
<idno type="wicri:Area/Main/Corpus">000184</idno>
<idno type="wicri:Area/Main/Curation">000162</idno>
<idno type="wicri:Area/Main/Exploration">000017</idno>
<idno type="wicri:explorRef" wicri:stream="Main" wicri:step="Exploration">000017</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary</title>
<author><name sortKey="Sajous, Franck" sort="Sajous, Franck" uniqKey="Sajous F" first="Franck" last="Sajous">Franck Sajous</name>
<affiliation wicri:level="3"><country>France</country>
<placeName><settlement type="city">Toulouse</settlement>
<region type="region" nuts="2">Occitanie (région administrative)</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
</placeName>
<wicri:orgArea>CLLE-ERSS</wicri:orgArea>
</affiliation>
</author>
<author><name sortKey="Navarro, Emmanuel" sort="Navarro, Emmanuel" uniqKey="Navarro E" first="Emmanuel" last="Navarro">Emmanuel Navarro</name>
<affiliation wicri:level="3"><country>France</country>
<placeName><settlement type="city">Toulouse</settlement>
<region type="region" nuts="2">Occitanie (région administrative)</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
</placeName>
<wicri:orgArea>IRIT</wicri:orgArea>
</affiliation>
</author>
<author><name sortKey="Gaume, Bruno" sort="Gaume, Bruno" uniqKey="Gaume B" first="Bruno" last="Gaume">Bruno Gaume</name>
<affiliation wicri:level="3"><country>France</country>
<placeName><settlement type="city">Toulouse</settlement>
<region type="region" nuts="2">Occitanie (région administrative)</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
</placeName>
<wicri:orgArea>CLLE-ERSS</wicri:orgArea>
</affiliation>
</author>
<author><name sortKey="Prevot, Laurent" sort="Prevot, Laurent" uniqKey="Prevot L" first="Laurent" last="Prévot">Laurent Prévot</name>
<affiliation wicri:level="4"><country>France</country>
<placeName><settlement type="city">Marseille</settlement>
<region type="region" nuts="2">Provence-Alpes-Côte d'Azur</region>
</placeName>
<orgName type="university">Université de Provence</orgName>
<orgName type="institution" wicri:auto="newGroup">Aix-Marseille Université</orgName>
</affiliation>
</author>
<author><name sortKey="Chudy, Yannick" sort="Chudy, Yannick" uniqKey="Chudy Y" first="Yannick" last="Chudy">Yannick Chudy</name>
<affiliation wicri:level="3"><country>France</country>
<placeName><settlement type="city">Toulouse</settlement>
<region type="region" nuts="2">Occitanie (région administrative)</region>
<region type="old region" nuts="2">Midi-Pyrénées</region>
</placeName>
<wicri:orgArea>CLLE-ERSS</wicri:orgArea>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2010</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">90792A953C1E3C5D11D0476C5E6873AE5BF8D8B1</idno>
<idno type="DOI">10.1007/978-3-642-14770-8_37</idno>
<idno type="ChapterID">37</idno>
<idno type="ChapterID">Chap37</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: The lack of large-scale, freely available and durable lexical resources, and the consequences for NLP, is widely acknowledged but the attempts to cope with usual bottlenecks preventing their development often result in dead-ends. This article introduces a language-independent, semi-automatic and endogenous method for enriching lexical resources, based on collaborative editing and random walks through existing lexical relationships, and shows how this approach enables us to overcome recurrent impediments. It compares the impact of using different data sources and similarity measures on the task of improving synonymy networks. Finally, it defines an architecture for applying the presented method to Wiktionary and explains how it has been implemented.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
<region><li>Midi-Pyrénées</li>
<li>Occitanie (région administrative)</li>
<li>Provence-Alpes-Côte d'Azur</li>
</region>
<settlement><li>Marseille</li>
<li>Toulouse</li>
</settlement>
<orgName><li>Aix-Marseille Université</li>
<li>Université de Provence</li>
</orgName>
</list>
<tree><country name="France"><region name="Occitanie (région administrative)"><name sortKey="Sajous, Franck" sort="Sajous, Franck" uniqKey="Sajous F" first="Franck" last="Sajous">Franck Sajous</name>
</region>
<name sortKey="Chudy, Yannick" sort="Chudy, Yannick" uniqKey="Chudy Y" first="Yannick" last="Chudy">Yannick Chudy</name>
<name sortKey="Gaume, Bruno" sort="Gaume, Bruno" uniqKey="Gaume B" first="Bruno" last="Gaume">Bruno Gaume</name>
<name sortKey="Navarro, Emmanuel" sort="Navarro, Emmanuel" uniqKey="Navarro E" first="Emmanuel" last="Navarro">Emmanuel Navarro</name>
<name sortKey="Prevot, Laurent" sort="Prevot, Laurent" uniqKey="Prevot L" first="Laurent" last="Prévot">Laurent Prévot</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/TlfNancyV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000017 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000017 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= TlfNancyV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:90792A953C1E3C5D11D0476C5E6873AE5BF8D8B1 |texte= Semi-automatic Endogenous Enrichment of Collaboratively Constructed Lexical Resources: Piggybacking onto Wiktionary }}
This area was generated with Dilib version V0.6.39. |